skip to main content


Search for: All records

Creators/Authors contains: "Zhang, Li"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    Plant diversity effects on community productivity often increase over time. Whether the strengthening of diversity effects is caused by temporal shifts in species-level overyielding (i.e., higher species-level productivity in diverse communities compared with monocultures) remains unclear. Here, using data from 65 grassland and forest biodiversity experiments, we show that the temporal strength of diversity effects at the community scale is underpinned by temporal changes in the species that yield. These temporal trends of species-level overyielding are shaped by plant ecological strategies, which can be quantitatively delimited by functional traits. In grasslands, the temporal strengthening of biodiversity effects on community productivity was associated with increasing biomass overyielding of resource-conservative species increasing over time, and with overyielding of species characterized by fast resource acquisition either decreasing or increasing. In forests, temporal trends in species overyielding differ when considering above- versus belowground resource acquisition strategies. Overyielding in stem growth decreased for species with high light capture capacity but increased for those with high soil resource acquisition capacity. Our results imply that a diversity of species with different, and potentially complementary, ecological strategies is beneficial for maintaining community productivity over time in both grassland and forest ecosystems.

     
    more » « less
    Free, publicly-accessible full text available December 1, 2025
  2. Abstract

    Traits of the spore‐bearing generation have historically provided the basis for systematic concepts across the phylogenetic spectrum and depth of mosses. Whether taxa characterized by a simple sporophytic architecture are closely related or emerged from independent reduction is often ambiguous. Phylogenomic inferences in the Funariaceae, which hold the model taxonPhyscomitrium patens, revealed that several such shifts in sporophyte complexity occurred, and mostly within theEntosthodon‐Physcomitriumcomplex. Here, we report the rediscovery of the monospecific, Himalayan endemic generaBrachymeniopsisandClavitheca, after nearly 100 years and 40 years since their respective descriptions. The genera are characterized by, among other traits, their short sporophytes lacking the sporangial peristome teeth controlling spore dispersal. Phylogenomic inferences reveal thatBrachymeniopsis gymnostomaarose within the clade ofEntosthodons.str., a genus with typically long‐exserted capsules. We therefore propose to transferB. gymnostomato the genusEntosthodon, asE. gymnostomuscomb. nov.Furthermore,Clavitheca poeltii, the sole species of the genus, is morphologically highly similar toE. gymnostomus, and should also be transferred toEntosthodon, but is retained as a distinct taxon,E. poeltiicomb. nov., until additional populations allow for testing the robustness of the observed divergence in costa and seta length between the Nepalese and Chinese populations.

     
    more » « less
    Free, publicly-accessible full text available December 1, 2024
  3. While Chain-of-Thought (CoT) prompting boosts Language Models’ (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query → symbolic reasoning chain) and Problem Solving (reasoning chain → answer), using an LM and a deterministic solver respectively. This guarantees that the reasoning chain provides a faithful explanation of the final answer. Aside from interpretability, Faithful CoT also improves empirical performance: it outperforms standard CoT on 9 of 10 benchmarks from 4 diverse domains, with a relative accuracy gain of 6.3% on Math Word Problems (MWP), 3.4% on Planning, 5.5% on Multi-hop Question Answering (QA), and 21.4% on Relational Inference. Furthermore, with GPT-4 and Codex, it sets the new state-of-the-art few-shot performance on 7 datasets (with 95.0+ accuracy on 6 of them), showing a strong synergy between faithfulness and accuracy. 
    more » « less
    Free, publicly-accessible full text available November 1, 2024
  4. Free, publicly-accessible full text available December 15, 2024
  5. Recent work has shown that prompting language models with code-like representations of natural language leads to performance improvements on structured reasoning tasks. However, such tasks comprise only a small subset of all natural language tasks. In our work, we seek to answer whether or not code-prompting is the preferred way of interacting with language models in general. We compare code and text prompts across three popular GPT models (davinci, code-davinci-002, and text-davinci-002) on a broader selection of tasks (e.g., QA, sentiment, summarization) and find that with few exceptions, code prompts do not consistently outperform text prompts. Furthermore, we show that the style of code prompt has a large effect on performance for some (but not all) tasks and that fine-tuning on text instructions leads to better relative performance of code prompts. 
    more » « less
    Free, publicly-accessible full text available June 1, 2024
  6. Following a recent phylogenetic study, we here review the circumscription of the grammitid fern genus Oreogrammitis (Polypodiaceae: Grammitidoideae). We propose three new genera Calligrammitis, Devolia, and Glabrigrammitis, to accommodate the three clades resolved outside of the core Oreogrammitis. The taxonomic treatment is presented, and the morphology of each new genus is shown with a color plate. 
    more » « less
    Free, publicly-accessible full text available May 10, 2024
  7. Entities and events are crucial to natural language reasoning and common in procedural texts. Existing work has focused either exclusively on entity state tracking (e.g., whether a pan is hot) or on event reasoning (e.g., whether one would burn themselves by touching the pan), while these two tasks are often causally related. We propose CREPE, the first benchmark on causal reasoning of event plausibility and entity states. We show that most language models, including GPT-3, perform close to chance at .35 F1, lagging far behind human at .87 F1. We boost model performance to .59 F1 by creatively representing events as programming languages while prompting language models pretrained on code. By injecting the causal relations between entities and events as intermediate reasoning steps in our representation, we further boost the performance to .67 F1. Our findings indicate not only the challenge that CREPE brings for language models, but also the efficacy of code-like prompting combined with chain-of-thought prompting for multihop event reasoning. 
    more » « less
    Free, publicly-accessible full text available May 1, 2024
  8. Alba, Mar (Ed.)
    Abstract T cells are a type of white blood cell that play a critical role in the immune response against foreign pathogens through a process called T cell adaptive immunity (TCAI). However, the evolution of the genes and nucleotide sequences involved in TCAI is not well understood. To investigate this, we performed comparative studies of gene annotations and genome assemblies of 28 vertebrate species and identified sets of human genes that are involved in TCAI, carcinogenesis, and aging. We found that these gene sets share interaction pathways, which may have contributed to the evolution of longevity in the vertebrate lineage leading to humans. Our human gene age dating analyses revealed that there was rapid origination of genes with TCAI-related functions prior to the Cretaceous eutherian radiation and these new genes mainly encode negative regulators. We identified no new TCAI-related genes after the divergence of placental mammals, but we did detect an extensive number of amino acid substitutions under strong positive selection in recently evolved human immunity genes suggesting they are coevolving with adaptive immunity. More specifically, we observed that antigen processing and presentation and checkpoint genes are significantly enriched among new genes evolving under positive selection. These observations reveal evolutionary processes of TCAI that were associated with rapid gene duplication in the early stages of vertebrates and subsequent sequence changes in TCAI-related genes. The analysis of vertebrate genomes provides evidence that a "big bang" of adaptive immune genes occurred 300-500 million years ago. These processes together suggest an early genetic construction of the vertebrate immune system and subsequent molecular adaptation to diverse antigens. 
    more » « less
    Free, publicly-accessible full text available May 1, 2024
  9. Entity linking, the task of linking potentially ambiguous mentions in texts to corresponding knowledge-base entities, is an important component for language understanding. We address two challenge in entity linking: how to leverage wider contexts surrounding a mention, and how to deal with limited training data. We propose a fully unsupervised model called SumMC that first generates a guided summary of the contexts conditioning on the mention, and then casts the task to a multiple-choice problem where the model chooses an entity from a list of candidates. In addition to evaluating our model on existing datasets that focus on named entities, we create a new dataset that links noun phrases from WikiHow to Wikidata. We show that our SumMC model achieves state-of-the-art unsupervised performance on our new dataset and on exiting datasets. 
    more » « less
  10. Abstract Background Events of gene fusion have been reported in several organisms. However, the general role of gene fusion as part of new gene origination remains unknown. Results We conduct genome-wide interrogations of four Oryza genomes by designing and implementing novel pipelines to detect fusion genes. Based on the phylogeny of ten plant species, we detect 310 fusion genes across four Oryza species. The estimated rate of origination of fusion genes in the Oryza genus is as high as 63 fusion genes per species per million years, which is fixed at 16 fusion genes per species per million years and much higher than that in flies. By RNA sequencing analysis, we find more than 44% of the fusion genes are expressed and 90% of gene pairs show strong signals of purifying selection. Further analysis of CRISPR/Cas9 knockout lines indicates that newly formed fusion genes regulate phenotype traits including seed germination, shoot length and root length, suggesting the functional significance of these genes. Conclusions We detect new fusion genes that may drive phenotype evolution in Oryza. This study provides novel insights into the genome evolution of Oryza. 
    more » « less